A New Load Balancing Approach for Parallel FP-Growth
نویسندگان
چکیده
Due to the exponential growth in worldwide information, companies have to deal with an ever growing amount of digital information. So the huge size of data and computation volume of new processing applications such as data mining, leads to new high performance parallel processing systems. One of the most important challenges of such application is quickly and correctly finding the relationship between available data. In this paper we target to reach the main goal of parallel processing which is high utilization of processing elements via load balancing and reducing the processing time. To reach an optimum load balancing algorithm, our proposed approach adopts a heuristic based on the weights of frequent item sets. Different scenarios of proposed method are considered and the processing time of those scenarios are obtained by GRIDSIM simulator. Finally, performance metrics of the proposed method such as computation time, Hit ratio of data etc... are extracted through simulation results. Keyword: load balancing, parallel computing, FP-Growth, frequent itemset mining
منابع مشابه
Load Balancing Approach Parallel Algorithm for Frequent Pattern Mining
Association rules mining from transaction-oriented databases is an important issue in data mining. Frequent pattern is crucial for association rules generation, time series analysis, classification, etc. There are two categories of algorithms that had been proposed, candidate set generate-and-test approach (Apriori-like) and Pattern growth approach. Many methods had been proposed to solve the a...
متن کاملParleda: a Library for Parallel Processing in Computational Geometry Applications
ParLeda is a software library that provides the basic primitives needed for parallel implementation of computational geometry applications. It can also be used in implementing a parallel application that uses geometric data structures. The parallel model that we use is based on a new heterogeneous parallel model named HBSP, which is based on BSP and is introduced here. ParLeda uses two main lib...
متن کاملAn Improved Technique Of Extracting Frequent Itemsets From Massive Data Using MapReduce
The mining of frequent itemsets is a basic and essential work in many data mining applications. Frequent itemsets extraction with frequent pattern and rules boosts the applications like Association rule mining, co-relations also in product sale and marketing. In extraction process of frequent itemsets there are number of algorithms used Like FP-growth,E-clat etc. But unfortunately these algorit...
متن کاملAn Incremental Parallel Scheduling Approach to Solving Dynamic and Irregular Problems
|Global parallel scheduling is a new approach for runtime load balancing. In parallel scheduling , all processors are cooperated together to schedule work. Parallel scheduling accurately balances the load by using global load information. As an alternative strategy to the commonly used dynamic scheduling, it provides a high-quality, low-overhead load balancing. This paper presents a parallel sc...
متن کاملLoad Balancing Approaches for Web Servers: A Survey of Recent Trends
Numerous works has been done for load balancing of web servers in grid environment. Reason behinds popularity of grid environment is to allow accessing distributed resources which are located at remote locations. For effective utilization, load must be balanced among all resources. Importance of load balancing is discussed by distinguishing the system between without load balancing and with loa...
متن کامل